UConn BioGrid REU 2008 SNP Individual Genotyping from Low-Coverage Sequencing Data

نویسندگان

  • Sanjiv Dinakar
  • Yözen Hernández
چکیده

Whole genomes can now be sequenced thanks to next-generation sequencing technologies. Costeffective sequencing of individual genomes would make it possible for genetic analysis to become commonplace, particularly for medical applications. However, sequencing data is only useful if information of value can be reliably extracted from it. Single nucleotide polymorphism, or SNP, genotyping from low-coverage sequencing data is a problem of particular importance. Here, we describe methods we have explored to produce accurate and reliable SNP genotypes from relatively low-coverage data obtained from newer shotgun sequencing technologies, in conjunction with information from reference panels such as the HapMap project.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Potential of low-coverage genotyping-by-sequencing and imputation for cost-effective genomic selection in bi-parental segregating populations

Genotyping-by-sequencing (GBS) is an alternative genotyping method to singlenucleotide polymorphism (SNP) arrays that has received considerable attention in the plant breeding community. In this study we use simulation to quantify the potential of lowcoverage GBS and imputation for cost-effective genomic selection in biparental segregating populations. The simulations comprised a range of scena...

متن کامل

Increasing Genome Sampling and Improving SNP Genotyping for Genotyping-by-Sequencing with New Combinations of Restriction Enzymes

Genotyping-by-sequencing (GBS) has emerged as a useful genomic approach for exploring genome-wide genetic variation. However, GBS commonly samples a genome unevenly and can generate a substantial amount of missing data. These technical features would limit the power of various GBS-based genetic and genomic analyses. Here we present software called IgCoverage for in silico evaluation of genomic ...

متن کامل

An integrative variant analysis pipeline for accurate genotype/haplotype inference in population NGS data.

Next-generation sequencing is a powerful approach for discovering genetic variation. Sensitive variant calling and haplotype inference from population sequencing data remain challenging. We describe methods for high-quality discovery, genotyping, and phasing of SNPs for low-coverage (approximately 5×) sequencing of populations, implemented in a pipeline called SNPTools. Our pipeline contains se...

متن کامل

SNP genotyping and parameter estimation in polyploids using low-coverage sequencing data

Motivation Genotyping and parameter estimation using high throughput sequencing data are everyday tasks for population geneticists, but methods developed for diploids are typically not applicable to polyploid taxa. This is due to their duplicated chromosomes, as well as the complex patterns of allelic exchange that often accompany whole genome duplication (WGD) events. For WGDs within a single ...

متن کامل

Reveel: large-scale population genotyping using low-coverage sequencing data

MOTIVATION Population low-coverage whole-genome sequencing is rapidly emerging as a prominent approach for discovering genomic variation and genotyping a cohort. This approach combines substantially lower cost than full-coverage sequencing with whole-genome discovery of low-allele frequency variants, to an extent that is not possible with array genotyping or exome sequencing. However, a challen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008